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5 

BACKGROUND OF THE INVENTION 

The present invention relates to data networking and more particularly to systems 
and methods for protecting against failure. 

Hierarchical digital transmission standards such as SONET, SDH, and G.709 are 
10 commonly used in optical service provider networks. These standards define multiple 
layers of a transmission hierarchy where each layer represents transmission at a particular 
data rate. Multiple lower bandwidth transmission signals of one hierarchical layer can be 
multiplexed together to form a higher bandwidth transmission signal at a lower 
hierarchical layer. A higher hierarchical layer thus contains signals that are more finely 
1 5 granulated in bandwidth than a lower layer. 

To provide high quaUty of service, networks employing such digital transmission 
standards require mechanisms to quickly respond to failures. To avoid disruption of data 
flow and preserve the user experience of voice and video services, it is desirable to 
reroute traffic affected by a link failure within 50 milliseconds. Numerous protection 
20 mechanisms have been developed in pursuit of this ideal. 

Increasingly hierarchical digital transmission standard-based networks are being 
used to carry packet-based traffic such as IP traffic and also MPLS traffic. The 
hierarchical digital transmission standards can be said to operate at the optical layer 
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(which includes the hierarchical transmission rate layers referred to above), whereas IP 
and MPLS are client layer protocols that exploit optical layer communication facilities. 
MPLS Traffic Engineering is an increasingly important technique for carrying traffic for 
5 which quality of service must be guaranteed. A set of techniques referred to as MPLS 
Fast Reroute has been developed to provide failure protection for MPLS Traffic 
Engineering tunnels. These protection mechanisms operate at the client layer whereas 
the protection mechanisms associated with the hierarchical digital transmission standards 
referred to above are associated with the optical layer. 

10 Problems arise due to interactions between failure protection mechanisms 

operating at the different layers. It is desirable to inhibit the operation of optical layer 
protection mechanisms for traffic that is already protected at the chent layer by a 
mechanism such as MPLS Fast Reroute. One way to accomplish this differential 
protection at the optical layer is to segregate protected traffic and unprotected traffic 

1 5 among different fibers or different wavelengths. This is extremely wastefiil of 
transmission capacity. 

Another prior art approach allows protected and unprotected traffic to share a 
link. When a failure occurs traffic is rerouted only for those paths that both employ the 
link and carry protected traffic. The traffic is rerouted at the endpoints of the path rather 
20 than at the point of failure. This approach requires the path restoration signaling to be 
done on a per-path basis between the points of failure and the path endpoints. Because 
there may be nimierous protected paths with disparate endpoints, the signaling burden 
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makes it difficult to achieve the objective of achieving rerouting of protected traffic 
within 50 milliseconds. 

Yet another approach, limited to a ring topology, accomplishes its signaling at the 
5 link layer to reduce signaling traffic, but provides end-to-end restoration of protected 
paths. This approach relies on flooding signaling information around the ring and thus 
will not work in a mesh network where such flooding is impractical. Also, restoration is 
slowed somewhat by the need to signal failure all the way to the path endpoints. 

What is needed are systems and methods for providing differential protection to 
10 traffic at the optical layer without incurring the drawbacks discussed above. 
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Summary Of The invention 

Embodiments of the present invention provide very fast protection switching 
while leaving selected traffic unprotected. When a link fails, only certain pre-selected 
5 paths are rerouted at the point of failure. Other traffic is left unprotected or protected at 
another layer. Signaling of the failure and the rerouting is performed for the link as a 
whole rather than for the individual paths. Signaling occurs at a first layer of a digital 
transmission hierarchy while path rerouting switching occurs at a second hierarchical 
layer above the first layer. In certain implementations, K1/K2 bytes are used in signaling 
10 at the optical line layer (OC-n), while only protected High-Order paths (STS) and Low- 
Order paths (VT) are protected by the protection switch. 

One aspect of the present invention provides a method of operating a node to 
handle link failure in a network employing a hierarchical digital transmission standard. 
The method includes: detecting failure of a data communication link, wherein the link is 
15 employed by a plurality of paths defined at a first hierarchical layer, signaling local repair 
of the failure using overhead information of a second hierarchical layer below the first 
hierarchical layer; and switching only protected ones of the plurality of paths to alternate 
routes through the network to avoid the failure. 

Further understanding of the nature and advantages of the inventions herein may 
20 be realized by reference to the remaining portions of the specification and the attached 
drawings. 
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Brief Description Of The drawings 

Fig. 1 depicts a mesh network to which embodiments of the present invention 
may be applied. 

5 Fig. 2 depicts a network device useful in implementing embodiments of the 

present invention. 

Fig. 3 is a flow chart describing steps of a mesh network protection scheme 
according to one embodiment of the present invention. 

Fig. 4 depicts a mesh network failure scenario according to one embodiment of 
10 the present invention. 

Fig. 5 is a signal flow diagram illustrating signaling according to one embodiment 
of the present invention. 
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DESCRIPTION OF SPECIFIC EMBODIMENTS 



The present invention will be described with reference to a specific example, a 
mesh network employing a hierarchical digital transmission standard. In the example as 
5 described herein, the hierarchical digital transmission standard is the SONET standard 
although the present invention is also readily applicable to other well-known standards 
such as SDH and G.709. Transmission in accordance with SONET, SDH, or G.709 is 
described in various standards and specifications published by the ITU, ETSI, ANSI, and 
Telcordia (formerly Bellcore), the contents of which are herein incorporated by reference 
10 in their entirety. 

A mesh network to which embodiments of the present invention may be applied is 
depicted in Fig. 1. A mesh network 100 includes nodes A-I interconnected by links 1-12. 
Each link is assumed to be bidirectional In the implementation to be described herein, 
each link represents OC-n, e.g., OC-48, communications, i.e., a payload data rate of 

15 approximately 2.4 Gbps. The OC-48 signal transmitted along the link includes 48 
multiplexed STS-1 signals with a payload of approximately 50 Mbps each. Data is 
transferred through the mesh network over so-called paths. Each path traverses a nxunber 
of links and nodes in the network. In the example to be described, multiple paths may 
incorporate the same link, each being assigned a different STS-1 channel. On a given 

20 link, some STS-1 channels may be reserved for end-to-end paths while others are 

reserved for carrying traffic that is being rerouted around a link failure. This example is 
merely representative. For example, the link may be, e.g., an OC-192 or OC-768 link. 
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Three representative paths employ STS-1 channels on link 2 from node B to node 
C. Paths ABCDH and ABCD are protected while path EFBCD is not protected. 
Counterpart paths in the reverse direction are also present. Of course this will be 
5 understood to be a simpUfied example. It is possible to have many more protected paths 
and/or unprotected paths employing a single link. A pre-configured protection route for a 
failure of link 2 extends through nodes B, F, I, G, and C. 

Fig. 2 depicts a representative network device 200 that may be used to implement 
nodes of mesh network 100. A network interface 202 is provided for each link to which 
10 the node connects. Each network interface 202 incorporates the necessary optical 
interface components including an appropriate laser and photodiode, circuitry for 
recovering digital data streams and for modulating digital data onto a transmitted signal, 
and appropriate circuitry for mapping STS-1 data streams into an OC-48 signal and for 
demapping STS-1 data streams from a received OC-48 signal. 

1 5 Individual STS-1 signals received and transmitted by the network interfaces 202 

are interconnected by a cross-connect 204. Cross-connect 204 is capable of connecting 
any received STS-1 signal to any desired STS-1 slot on any transmitted OC-48 signal. 
The cross-connection state of cross-connect 204 is configured by a processor 206. 

Processor 206 also receives and generates overhead information that may be 
20 included with the individual STS-1 signals as well as the overall OC-48 signals. 

Processor 206 controls signaling and switching operations to implement embodiments of 
the present invention. Processor 206 executes software that would be stored in a 
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computer-readable storage medium such as a memory 208. Other examples of a storage 
medium that may hold instructions for execution by processor 206 include, e.g., CD- 
ROMS, DVD-ROMS, floppy disks, a signal received over the Intemet, etc. FunctionaUty 
5 of processor 206 may be divided among multiple processors or may be implemented all 
or in part by integrated circuits such as FPGAs, ASICs, etc. 

Fig. 3 depicts a protection scenario useful in describing embodiments of the 
present invention. Fig. 3 depicts the same mesh network 100 as in Fig. 1, with a failure 
on link 2, The previously mentioned three representative paths are impacted by the 
10 failure on link 2 from node B to node C. Paths ABCDH and ABCD are protected and 
have been rerouted along the preconfigured protection route to circumvent the failed link 
2, while path EFBCD remains routed along link 2 since it is not protected. 

Fig. 4 is a flow chart describing steps of operation according to one embodiment 
of the present invention. At step 402, the paths to be protected are pre-configured as 

1 5 protected paths at the nodes they traverse. This may be part of the process of distributing 
and configuring path information at the various nodes. Li one embodiment, a path 
computation server (not shown) computes the needed paths using an appropriate 
algorithm such as ones based on the well-known Dijkstra algorithm. The path 
computation server tells each node which paths are traversing it, which links are being 

20 employed for each path, and which channel is being used for a given path on the inboxmd 
link and the outbound link. Each node is also made aware of which traversing paths are 
to be protected and which are to be left unprotected. 
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Step 404 pre-configures backup routes for the paths to be protected. Each link 
with protected paths has a protection route (typically one but possibly more) onto which 
protected traffic will be locally diverted in case of a failure. These protection routes may 
5 be computed by the path computation server referred to above and sent to the nodes of 
the mesh network. Altematively, the protection route may be computed locally based on 
an understanding of the mesh network topology. Specifying the protection route will 
involve specifying the included nodes and links and the STS-1 channels to be used on 
each link. Sufficient aggregate bandwidth should be allocated to accommodate all of the 

10 protected traffic. For example, the protection route may reserve as many channels on 
each included link as there are protected paths. An individual channel may be allocated 
to multiple protection routes configured to handle independent failures. The endpoint 
nodes and intermediate nodes of a protection route should be aware of the channel 
assignments so that they can appropriately configure their cross-connects in the event of a 

1 5 failure. The protection routes implement local repair around a failed link rather than an 
end-to-end reroute of a path including a failed link. 

In normal operation, traffic flows through the protected and unprotected paths. 
Assume that a failure occurs that disrupts all of the communication on link 2 between 
nodes B and C. This failure is detected at step 406. An indication of failure is then 
20 signaled between nodes B and C at step 408. The signaling traffic goes via the protection 
route. The signaling is performed within the OC-48 overhead of the links along the 
protection route rather than within the overhead of individual STS-1 channels. Nodes B 
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and C are aware of which paths will be rerouted in response to the failure based on the 
pre-configuration of step 402. This greatly saves on the necessary signaling resources. 

A step 410 represents the rerouting of the protected paths on the failed link. Node 
5 B, for example, will bridge the traffic of the protected paths onto the adjoining link in the 
protection route. For protected paths that it is receiving, node B will select from the 
adjoining link of the protection route, rather than from the failed link. Nodes along the 
protection route will begin selecting rerouted traffic from the previous link of the 
protection route and forwarding it onto the next link of the protection route. 

10 The rerouted traffic will exploit channels allocated to failure protection on the 

protection route. Multiple protection routes may be allocated the same channel or 
channels. Also, channels allocated to protection may be used to carry low priority traffic 
or manually routed traffic if there is no failure to handle. A prioritization scheme will be 
used to determine whether traffic to be rerouted will actually displace current traffic. The 

15 highest priority is allocated to traffic to be rerouted due to a detected signal failure (SF). 
Traffic to be rerouted due to detected signal degradation (SD) condition has a lower 
priority than SF traffic. Unprotected traffic flowing through a protection channel 
reserved for use of other protected traffic in case of a failure has a lower priority than SD 
traffic. Manually routed traffic has the lowest priority. 

20 The signaling and rerouting operations of steps 408 and 410 will be described in 

greater detail with reference to Fig. 5. Paths ABCDH and ABCD are protected and, in 
response to the failure, they are rerouted from the failed Unk 2 to instead traverse nodes 
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BFIGC. Fig. 5 shows the failure detection in rerouting signal flow according to one 
embodiment of the present invention. These signals are embedded in the K1/K2 bytes 
within the OC-48 overhead as known in the art. The format of the information 
5 communicated in these K1/K2 bytes is: [Action Code: Requesting Link ID: Bridged 
Path ID]. During normal operation in the absence of a failure there is a constant 
exchange of K1/K2 bytes on each link. In this normal mode the content of the bytes is 
[NR,0,0] where NR signifies ''No Request." 

In Fig. 5, when a failure (e.g., a fiber cut) occurs on the link from node C to node 
"10 B, node B detects a loss of signal (LOS). In response to the indicated LOS, node B 
begins the failvu-e and reroute signaling. Node B sends node C a message [SF,2,0] 
indicating a signal failure (SF) on link 2 and this message is sent along the protection 
route for link 2. AUematively, reroute may occur in response to a detected signal 
degradation (SD) which may be similarly signaled. Intermediate nodes that receive the 
1 5 [SF,2,0] configure their cross-connects to activate the protection route for the protected 
traffic flowing from node C to node B. Node C responds by bridging the protected traffic 
that it was sending on link 2 onto link 6. 

Node C then sends [RR,2,2]. This indicates a Reverse Request for node B to also 
bridge and that node C has akeady bridged the traffic originally slated for link 2. 
20 Intermediate nodes that receive the [RR,2,2] configure their cross-connects to activate the 
protection route for protected traffic falling from node B to node C. Node B then 
responds by selecting link 5 for the protected traffic it would have received on link 2 and 
by bridging the protected traffic it would have sent on link 2 onto link 5. 
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Node B sends [SF,2,2] to node B to report its own bridging action. Node C 
responds by selecting to receive the protected paths via hnk 6. For each bridging and 
selection operation, nodes B and C take action for all protected traffic affected by the 
5 failure and leave any unprotected traffic alone based on the previously configured 

protection status. The rerouting of numerous protected paths can be accomplished in this 
way with minimal signaling while preserving granularity in configuring protection. It 
will also be noted that the signaling is not flooded throughout the mesh network but 
rather flows only through the nodes of the protection path. The protection scheme, in 
10 contrast to ring-based schemes, does not rely upon knowledge of every path being 
distributed throughout the network. 

During the cross-connect configuration steps, care should be taken to avoid 
misconnecting traffic from the newly protected connection to a connection that was using 
the same bandwidth for a lower priority purpose (if such a lower priority connection 
1 5 exists). This can be done by dropping traffic at the rerouted path's endpoint when, e.g., 
the path trace identifier embedded in the STS-1 frame does not match what is expected. 

Altematively, the cross-connect configuration for both directions of transmission 
can be postponed to be triggered by the [RR,2,2] message instead of one direction being 
triggered by the [SF,2,0] message. In this altemative embodiment, the [SF,2,0] message 
20 only serves to disconnect prior connections that were using the required protection 
bandwidth, if their priority is lower than SF. 
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The above description assumes that signaUng occurs at the OC-48 layer of the 
transmission rate hierarchy whereas protection is configured at the STS-1 layer and 
switching and bridging operations occur at the STS-1 layer. This is only one possible 
5 implementation. For example, signaling may be accompUshed at the STS-1 layer using 
the overhead available there while protection may be tumed on or off for individual paths 
that employ VT1.5 channels within the STS-1 signal. The signaling is along the lines 
described above. In general, signaling can occur at any particular layer in a SONET, 
SDH, or G.709 hierarchy with protection being granulized to lower bandwidth 
10 constituent signals at some layer below the signaling layer. The actual bridging and 
selecting are then implemented at the lower layer only for the protected paths. 

It is understood that the examples and embodiments that are described herein are 
for illustrative purposes only and that various modifications and changes in light thereof 
will be suggested to persons skilled in the art and are to be included within the spirit and 
15 purview of this application and scope of the appended claims and their fiiU scope of 
equivalents. For example, the flowchart steps of Fig. 4 may be deleted, performed in a 
different order, or combined with other steps. Similarly, the signal flow of Fig. 5 may 
also be modified. 
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